AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Moonshot Launches Kimi Linear Model: Processing Long Contexts 2.9 Times Faster

The Moonshot team has launched the Kimi Linear model, achieving a technological breakthrough in the AIGC field. The model uses a hybrid linear attention architecture, improving the speed of processing long contexts by 2.9 times and decoding speed by 6 times. Its performance surpasses the traditional Softmax attention mechanism, showing excellent results particularly in scenarios such as context processing and reinforcement learning.

15.4k 1 days ago
Moonshot Launches Kimi Linear Model: Processing Long Contexts 2.9 Times Faster

Models

View More

Qwen3-Next-80B-A3B-Instruct

Alibaba

Qwen3-Next-80B-A3B-Instruct

$2

Input tokens/M

-

Output tokens/M

256

Context Length

Qwen3-0.6B

Alibaba

Qwen3-0.6B

$0.3

Input tokens/M

-

Output tokens/M

32

Context Length

o1-pro

Openai

o1-pro

-

Input tokens/M

-

Output tokens/M

-

Context Length

MiniMax Text 01

Minimax

MiniMax Text 01

$1

Input tokens/M

$8

Output tokens/M

128

Context Length

Qwen_v2.5_3b_Instruct

Alibaba

Qwen_v2.5_3b_Instruct

$1

Input tokens/M

-

Output tokens/M

32

Context Length

Yi-Lightning

01-ai

Yi-Lightning

$0.99

Input tokens/M

$0.99

Output tokens/M

32

Context Length

CogView-3-Plus

Chatglm

CogView-3-Plus

-

Input tokens/M

-

Output tokens/M

-

Context Length

AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map